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New Claims 



The new claim 1 is fomied from the original claim 1 and the wording of the new claim 1 is 
clarified. In detail, the subject-matter as defined by the independent claim 1 is operable 
with hybrid speech coders and hybrid speech decoders set forth with reference to the 
original claim 22, page 9, lines 6 - 9 and page 25, lines 1 - 4 as well as claim 24, page 21 - 
24 and page 25, lines 1 - 4 of the specification. The purpose of the derived phase- 
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characterizing parameter to prevent misahgnment between the preceding and the 
subsequent frames is disclosed on page 4, lines 6 - 10 of the present spec. 

In order to state the sequence of the frames in question defined in the independent claim 1 
more precisely, the frame coded according to the waveform matching speech coding is 
designated as preceding frame, whereas the frame according to the parametric speech 
coding is designated as subsequent frame. A corresponding disclosure relating the 
preceding frame coded according to the waveform matching speech coding can be 
obtained from page 4, lines 1- 2. Moreover, a disclosure relating to the subsequent frame 
coded according to the parametric speech coding can be gathered from the teaching on 
page 4, lines 6-10, which also discloses the immediate succession of the frames in 
question. ^ 

The new claims 2 to 11 correspond to the original claims 2 to 1 1 . 
The new independent claim 8 is adapted t^ 



, . .The new claims 12 to 27 correspond to, the original clairiis 12 to 27.? \, ' ^ ' 

, ;^ Thenmnberingmd back references of the new claims ^ , '\ 



> state oftrieArt -1 mm^^^^y ' : : / ^^ " :JJ >^ ^ 




« • .f; , > v v' ^Elecb^^ I^ttiers^ v6ir2i 685'- 688, May 2001, ISBN 0-7803-7041 -4>: "-r , 

•■•:■V•■■^ : ' ■""'^••T '-J?-- 

pi is the closest prior art dbcmnent relevant for tiie present invention. 

Dl relates to a hybrid coder with a phase model to synchronize harmonic and waveform 
coded segments. The coder of Dl is based oh an analysis by synthesis to distinguish 
between stationary and transitional segments. Harmonic excitation is synchronized with a 
linear (LPC) residual by transmitting the location of the pitch pulse closest to the frame 
boundary and a phase value that represents the shape of the corresponding pitch pulse. 
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III. Object 

A first object of the present invention is to provide a method and a device capable for 
operating the method for determining phase characteristics of a frame coded according to a 
waveform matching coding for providing the determined phase characteristics as an initial 
phase characteristics for parametric coding of a succeeding speech frame. 

A second object of the present invention is to provide a method and a device capable for 
operating the method for detecting misalignments between a preceding frame coded 
according to a waveform matching coding and a frame succeeding (immediately) the 
preceding frarne coded according to a parametric speech coding to enable a re-coding of 
the succeeding frame. 



IV. Solution 



The^^^ 

^^^^ <^ speech decoders,. : whi^A^^ :^r^ prpyides^ phase^i . 

r^'. 't^l^p^ai^^ wluch ■ -sucmIm^ rilali^ ^ 

^^^^^^^,^'^^^^^^^^^^^!^^^ ^'-^^ enal!)les taprev^^^ 

: \ :<ievice ;^ method evaluates characteristics /; < 5 ,.:/ 

V: >i kJS^^^^^ ^avefi^^ 

^ time to the wayefo^ 



irr^questiicm^ 

V; Noveltv^ Step ' '^• ' V ' 

Dl establishes the closest prior art relevant in view of the present invention. 

Dl teaches the synchronizing between harmonic and waveform coded segments, which is 
obtained by the means of the synchronized waveform matched phase model (SWPM) 
adapted to preserve time sjmchrony and waveform shape between original speech and 
synthesized speech. This methodology of Dl relates rather to a classical method to 
maintain synchrony between the synthesized, i.e. synthesized by a parametric speech 
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coder, and the original speech. In the fully voiced frames, where a parametric speech coder 
is employed, supplementary phase information is determined and transmitted to the speech 
decoder. This phase information is based on the pitch pulse location and pitch pulse shape 
in the original speech and in the voiced frame. In the succeeding voice frames the phase 
information is interpolated between the phase value at the end of the previous frame and 
the transmitted value for the current frame. 

In contrast to Dl, the present invention relates to transients between frames coded by a 
waveform coder and a parametric coder. To get the phase value in the first parametric 
frame, the estimated pitch pulse value in the waveform frame is exploited to ensure 
smooth transition. Consequently the synchrony between the original and coded speech 
needs not to be remained, which is asserted ais a basic concept of the teaching in 
accordance with Dl . The present invention reflects the requirement to guarantee a smooth 
transition between succeeding frames coded by waveform speech coding and parametric 
speech cpding. How to hsuidle this s^^ transition in terms 6f quality is 

not discussed in Dl since the te^ problem of 

phase continuation between the voiced frames. , ^ , i , . ^ ^ ^ - 

Fijtflier^^ tedmbiogy rd 

coinS^pl^^^^ 
illustraj^ 




the pitch cphtiiiuity is chieipked during the frahsintidh regjoris. , D 1 ; does not ;ir^ 




Conclusively, the methods, devices and system defined in the claims accompanying the 
present invention are not anticipated by the closest prior art documei^^ 

As aforementioned, the closest prior art document Dl provides a closed teaching which 
relates to the phase continuation between the voiced frames and solves the problem by 
transmitting supplementary phase information being based on the pitch pulse location and 
pitch pulse shape in the original speech and in the voiced frame. The phase information of 
Dl enables to ensure the synchrony between original and synthesized speech. Due to the 
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fact that Dl defines a closed teaching, the inventive concept and inventive teaching 
according to the present invention in neither included in Dl nor suggested by Dl. This 
fact is also visible to those skilled in the art when referring to the concept according to the 
present invention to detect and/or eliminate misalignments between frames in question at a 
network device disposed intermediately within the transmission link of terminals 
communicating said frames. 

Consequently, the claims being presently on file of the invention are likewise assumed as 
inventive over the closest state of the art document D 1 . 

VI. Requests 

The Examiner is kindly requested to reconsider its opinion about the patentability of the 
present invention in view of the arguments provided above and with respect to the 
accompanying set of new claims. ^ 




Dr: Thomas Kurig ^ 
(Patent Attorney) . 

Enclosure: : Set of new claims 1 to 27 • 



Applicant: Nokia Corporation 

Our Ref.: 50907 WO WO (KG/TP) 



New Claims 

1. Method for providing at least one phase-characterizing parameter for speech pi-ocessing 
operable with hybrid speech coders and hybrid speech decoders, comprising: 

- obtaining characteristics of a preceding frame coded according to a waveform matching 
speech coding; said preceding frame according to said waveform matching speech coding 
being immediately preceding in time to a succeeding frame according to a piarametric 
speech coding 

characterized by 

- ; derivuijg said at least pile phase-characteriziiig pariarneter for proc 

fixime according to Siaid parametric speech cb<^ obtained chiaracteristics; 

whereiii said at least one phase-characterizingv^a^ 
mis 



idframes. ' f •^ ?r:. ^ 'Iv':-^'' 

2r Method according to claim 1, wherein said speech prpcessing is a speech encoding operiation; ^ ^: . 




idiiig fra^ 

^;^,/CQmpnses:;^/--.:;.:.} ^ ' v''-*^^-?' \ ''*: -l^v^^' 'il-'"^'/^ 

'^ d^^ positions. ~ 

4-^ waveform m 
- detenhiiiihg a poisition of a last pulse of smd at 1^ 

5. Method according to claim 4, wherein said at least one pulse is at least one pitch pulse. . v 

6. Method according to claim 4 or claim 5, wherein said step of obtaining characteristics of said 
preceding frame according to a waveform matching speech coding comprises: 

determining a pulse value from the distances between said at least two pulses. 



7. Method according to claim 4 or claim 5, wherein said obtaining characteristics of said 
preceding frame according to a waveform matching speech coding comprises: 
- obtaining a pulse value from an antecedent frame. 



8. Method according to claim 6 or claim 7, wherein said at least one phase-characterizing 
parameter is obtained from said position of said last pulse relative to a size of said preceding 
frame according to said waveform matching speech coding in relation to said pulse value. 



9. Method according to anyone of the preceding claims, wherein said at least one phase- 
characterizing parameter is at least one phase value. 

10. Method according to anyone of the claims 2 to 9, wherein said determining of said positions 
comprises: 

- determining average energy values from said preceding frame according to said waveform 
matching speech coding and 

- eyaluatiiig said average energy values in order to determine positions of at least one local 
. -maximal energy value and ' .'^ ' : • 

r - assigning said positions of said kt least one local niaximal energy value to said positions 



li; Method; a^^^ v^ues ; > ^ 

12. Methdcfe fo^ inisalignm^ 

- ^^^accbirdiiig^ to avwavefbrm miatcnihg speech rcoding^^^^ 




• : ' pbtaihirig characteristics of said preceding fra^ 

speech .coding, : X ^ -V' 1^' .-V^^rf^^-'^'''^^^^^ 

- obtaining chiaracteristics of said succeeding frame according to ssdd param 
coding, and 

evaluating said obtained characteristics in order to detect said transition misalignment. 

13. Method according to claim 12, wherein said obtaining characteristics of said preceding frame 
according to said waveform matching speech coding comprises: 

- determining positions of at least one pulse from said preceding frame according to said 
waveform matching speech coding and 

- determining a position of a last pulse of said at least one pulse. 



and wherein said obtaining characteristics of said succeeding frame according to said 
parametric speech coding comprises: 

determining positions of at least one pulse from said succeedingframe according to said 

parametric speech coding and 
S - determining a position of a first pulse of said at least one pulse, 

14. Method according to claim 13, wherein said pulses are pitch pulses. 

15. Method according to claim 13 or claim 14, wherein said evaluating said obtained information 
10 comprises: 

- G|etermimng a distance of said position of said last pulse and said position of said first 
piilse and 

^9 - comparing said distance with a pulse value. 

15 . : 16. Method accprding to claim 15, wherein said pulse is obtairied by the step of: 

; ^ - deternuning said p^ value from distance$ of said' ^ preceding 
\ . frame according to said waveform matching speech coding. 

: . /i7- M 

20 -V 

. said parametric speech coding. ^ < , \ . \ 

- ? 5 iSv Mj^^ td^;anypne of the cIswols- XI id: said^ 

" '■■ \;-^^''P9srtiomxpm|m ■ " '''-'rt/^'^i^'-^^^^^^^ 

_25 : , ;-- i^^^ 'energy- values 'frbm'.sMd,fi^ att^- , '-^'i; ■ u i. -i^v^. 'f ' ■ •-' • 

^ \ ' V iii o^^ 

- assigning said positions of said at least one local inaximkl energy v^ue tp said positions 
of said at least one puls^^ ; ■ 

30 . . • ■ ■; ■ ' ■ . ■ ^ ■ 

19. Software tool for speech processing, comprising program code portions for carrying out the 
operations of any one of claims 1 to 18, when said program is implemented in a computer 
program for executing on a computer, a user terminal or a network device. 
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20. Computer program for speech processing, comprising program code section for carrjdng out 
the operations of any one of claims 1 to 18, when said program is run on a computer, a user 
terminal or a network device. 
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21. Computer program product for speech processing, wherein said computer program product is 
comprising program code sections stored on a computer readable medium for carrying out the 
method of any one of claims 1 to 18, when said program product is mn on a computer, a user 

5 temiinal or network device. 

22. Communication terminal device offering enhanced quality of transmitted speech data 
comprising a speech encoder including a parametric speech encoding unit, a waveform 
matching speech encoding unit, and a communication interface for communicating speech 

10 encoded data via a mobile commxmication network, wherein said speech encoder is able to 

operate the method for providing at least one phase-characterizing parameter for coding a 
succeeding frame according to a parametric speech coding according to anyone of the claims 

^ Itoll.-:. ■ , .. .. ... : 

15 23. Communication terminal device offering enhanced quality of transmitted speech data 
comprising a speech decoder including a parametric speech decoding unit and a waveform 
matching- speechv decoding^^^^^^ 

encoded data via a mobile communication networki: wherein said speech decoder is able to 
^ r operate' the method for detecting a transition misalignment in transition from a preceding 
20 ' frame according to a waveform matching speech coding to a succeeding frame according to a 
parametric speech coding according to anyone of the claims 12 to 18. . ^ 

■ 24. Terminal dievice iccording to claim 23,.sadd,speech decoder being additionally able to oper^^ 
the method for providing at least one phase-characterizing parWeter for coding 
25 frame according to a parametric speech coding according to anyone of the claims 1 to 11.^^^^^^ 

25. Network device offering enhanced quality of tr&isinitted speech , data ' comprising a 
communicatiori interface for receiving encoded speech data and, transmitting encoded speech 
data and an analyzing unit, said analyzing unif being able to operate the method for detecting 

30 a transition misalignment from a preceding frsime according to a waveform matching speech 

coding to a siicceeding franie according to a pianimetric ispeech coding according to anyone of 
the claims 12 to 18. 

26. Network device according to claim 22, said analyzing unit being additionally able to operate 
35 the method for providing at least one phase-characterizing parameter for coding a succeeding 

frame according to a parametric speech coding according to anyone of the claims 1 to 1 1 . 




5 



27. System offering enhanced quality of transmitted speech data comprising: 

a first terminal coniprising a speech encoder for encoding speech and a communication 
interface for transmitting encoded speech data, 

a first terminal comprising a speech decoder for decoding said encoded speech data and a 
communication interface for receiving said encoded speech data, 
- an intermediate network device offering enhanced quality of transmitted speech data 
according the anyone of the claims 25'to 26. 
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